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ABSTRACT 

The Baryon Acoustic Oscillation (BAO) feature in the power spectrum of galaxies provides 
a standard ruler to probe the accelerated expansion of the Universe. The current surveys cov- 
ering a comoving volume sufficient to unveil the BAO scale are limited to redshift z < 0.7. 
In this paper, we study several galaxy selection schemes aiming at building an emission-line- 
galaxy (ELG) sample in the redshift range 0.6 < z < 1.7, that would be suitable for future 
BAO studies using the Baryonic Oscillation Spectroscopic Survey (BOSS) spectrograph on 
the Sloan Digital Sky Survey (SDSS) telescope. We explore two different colour selections 
using both the SDSS and the Canada France Hawai Telescope Legacy Survey (CFHT-LS) 
photometry in the u, g, r, and / bands and evaluate their performance selecting luminous ELG. 
From about 2,000 ELG, we identified a selection scheme that has a 75 percent redshift mea- 
surement efficiency. This result confirms the feasibility of massive ELG surveys using the 
BOSS spectrograph on the SDSS telescope for a BAO detection at redshift z ~ 1, in particular 
the proposed eBOSS experiment, which plans to use the SDSS telescope to combine the use 
of the BAO ruler with redshift space distortions using emission line galaxies and quasars in 
the redshift 0.6 < z < 2.2. 

Key words: cosmology - large scale structure - galaxy - selection - baryonic acoustic oscil- 
lations 



1 INTRODUCTION 

With the discovery of the acceleration of the expansion of the uni- 
verse ( [Riess et al.|1998[[Perlmutter et al.|1999| ), possibly driven by a 
new form of energy with sufficient negative pressure, recent results 
have concluded that ~ 96 percent of the energy density of the uni- 



verse is in a form not conceived by the Standard Model of particle 
physics and not interacting with the photons, hence dubbed "dark". 
Lying at the heart of this discovery is the distance-redshift rela- 
tion mapped by the type la supernovae (Snia) combined with the 
temperature power spectrum of the cosmic microwave background 
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Table 1. Requirements for measuring at 1 percent the BAO signal, h is the required density to overcome shot noise. The observed density [galaxy deg~^] is 
the projection of h on the sky. The Sample variance area required [deg^] corresponds to the area necessary to obtain an effective volume of 1 Gpc^ at 
h ^ 0.063 h Mpc"^ and k2 ^ 0.12 h Mpc"^ given the value n. A/req is the number of thousands of redshifts required to detect BAO at ki and k2: it is the 
product of the required observed density multiplied by the area required, 'req.' stands for required. 



redshift 


Shot Noise req. 


observed density req. 


Sample variance 


A^req 


range 


n(ki) 


n(k2) 


[deg 


-2] 


area req. [deg^] 


[10^ redshifts] 






for k\ 


for k2 




for ki for k2 


[0.3,0.6] 


1.0 


2.1 


33 


71 
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2.9 
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195 421 


[1.2,1.5] 


1.5 


3.2 


164 


354 


1227 


201 435 


[1.5,1.8] 


1.7 


3.6 


273 


589 


1041 


284 613 



fluctuations. Since the first detections, there has been a huge incre- 
ment of data up to redshift z - 1 (,Riess et al.|199 8|Perlmu tter et al.| 
1999|Astier et al.|2006|Wood-Vasey et al.|2C)07|[R iess et al .|2004| 



Riess et al.|2007nDawson et al.|2009[|Riess et al.|201 1 ) The cur- 



plans to map the high-redshift universe in the redshift range 0.6 < 
z < 2 using the largest possible volume; see BigBOSS ( |Schlegel| 
et al.||201l| ), PFS-Su]V[IR^ and EUCLIE0 To achieve this goal, 



rent precision and accuracy required to obtain deeper insight on the 
cosmological model using Snia is limited by the systematic errors 
of this probe; therefore a joint statistical analysis with other probes 
is mandatory to assess a firm picture of the cosmological model. 

Corresponding to the size of the well-established sound hori- 
zon in the primeval baryon-photon plasma before photon decou- 
pling ( [Peebles & Yu|1970l , the BAO scale provides a standard ruler 
allowing for geometric probes of the global metric of the universe. 
In the late-time universe it manifests itself in an excess of galaxies 
with respect to an unclustered (Poisson) distribution at the comov- 
ing scale r ~ 100/i~^Mpc — corresponding to a fundamental wave 
mode k ~ 0.063/iMpc~^ The value of this scale at higher redshift 
is accurately measured by the peaks in the CIVIB power spectrum 
{e.g. [Komatsu et al.|2009l|2011 ). Galaxy clustering and CMB ob- 
servations therefore allow for a consistent comparison of the same 
physical scale at diff'erent epochs. 

The first detection of the 'local' BAO { [Cole et al. 120051 |Eisen-| 
[stein et al.|2005| ) were based on samples at low redshift z < 0.4. 
Further analysis on a larger redshift range (z > 0.5) and a wider 
area confirm the first result, reducing the errors by a factor of 2 
( [Percival et al. [20 10[ [Blake et al.|20lTl ). IMeasurements of the BAO 
feature have thus become an important motivation for large galaxy 
redshift surveys; the small amplitude of the baryon acoustic peak, 
and the large value of tbao^ require comoving volumes of order of 
~ IGpc^/z"^ and at least 10^ galaxies to ensure a robust detection 
(g.g. [Tegmark|1997a[[BTake & Glazebrook|2003| ). 

BAO studies using luminous red galaxies (LRG) are currently 
being pushed to z = 0.7 by the Baryonic Oscillation Spectroscopic 
Survey (BOSS) experiment as part of the Sloan Digital Sky Sur- 
vey III (SDSS-III) survey (Eisen stein et al.[[20rij . So far, with a 
third of the spectroscopic data, the BAO feature has been measured 
at z = 0.57 with a 6.1cr significance ([ Anderson et al.|20"T2j ). The 
final data set, which will be completed by mid-2014, will have a 
mean galaxy density of about 150 galaxies per square degree over 
10,000 deg^. Recently, the WiggleZ experiment has obtained a sig- 
nificant ~ 4.9cr detection of the BAO peak at z = 0.6, by combin- 
ing information from three independent galaxy surveys: the SDSS, 
the 6-degree Field Galaxy Survey (6dFGS) and the WiggleZ Dark 
Energy Survey ( [Blake et al.| 2011 ). In contrast to SDSS, WiggleZ 
has mapped the less biased, more abundant emission line galaxies 
( [Drinkwater et al.|2010| ). 

The next generation of cosmological spectroscopic surveys 



suitable tracers covering this redshift range are needed. Above 
z ~ 0.6 the number density of LRGs decreases while the bulk of 
galaxy population is composed of star forming galaxies ( Abraham[ 
[et al.|1996[[Ilbert et al.|2006j ); it is therefore compelling to build a 
large sample of such type of galaxies, which allows one to cover a 
large area and hence a large volume. The main challenges for fu- 
ture BAO surveys is to efficiently select targets for which a secure 
redshift can be measured within a short exposure time. Contrary 
to continuum-based LRG survey, the observational strategy of next 
generation surveys such as BigBOSS, PFS-SulVLIRe, and EUCLID 
is based on redshift measurements using emission lines, which are 
a common feature of star-forming galaxies. In this paper we focus 
on targeting strategies for selecting luminous ELGs at 0.6 < z < 1 .7 
using optical photometry, and we test our strategies using the BOSS 
spectrograph on the SDSS telescope (| Gunn et al.|2006| ). 

The plan of the paper is as follow. In section [2] we derive the 
necessary ELG redshift distribution to detect the BAO feature. In 
section[3]we explain how the ELG selection criteria were designed 
using diff'erent photometric catalogs, based on the performances of 
the BOSS spectrograph. In section|4]we compare observed spectra 
issued from this selection with simulations and we discuss the effi- 
ciency of the proposed selection schemes. In sectionj5]we discuss 
the main physical properties of the ELGs. In section [6] we present 
the redshift distribution of the observed ELGs and how to improve 
the selection. In appendix [A| we display a representative set of the 
spectra observed. 

Throughout this study we assume a flat ACDM cosmology 
characterized by erg) = (0.27,0.96,0.81). Magnitudes are 

given in the AB system. 



2 BARYON ACOUSTIC OSCILLATIONS 

2.1 Density and geometry requirements 

In order to constrain the distance-redshift relation at z > 0.6 us- 
ing the BAO, we need a galaxy sample that covers the volume of 
the universe observable at this redshift. In this section we derive 
the required mean number density of galaxy, h{z), and the area to 
be covered in order to observe the BAO feature at the one percent 
level. 



http ://sumire . ipmu .j p/en/ 
http://sci.esa.int/euclid 
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Figure 1. PfN - hP, ratio of the fiducial power spectrum over the shot noise as a function of redshift at ^ ^ 0.063 (left) and at ^ ^ 0.12 (center) for 
^ e [1, 3.5] 10~^ /z^Mpc"^. (Right) Effective volume in Gpc^ sampled as a function of sky coverage, for five different redshift bins from 0.3 to 1.8, the 
sampling density used for each bin is given in the column n{k2) ofTable[l] To reach a comoving volume of 1 Gpc h ^ over the redshift range [0.6, 0.9], it is 
necessary to cover about 2500 deg^. 



The statistical errors in the measure of the power spectrum of 
galaxies P{k, z), evaluated at redshift z and at scale k, arise from 
sample variance and shot noise ( Kaiser 1986 ). Denoting the latter 
as N{z) - l/^(z), to measure a significant signal the minimal re- 
quirement is 



P(k,z) 
N{z) 



>2. 



(1) 



As the amplitude of the power spectrum decreases with redshift, 
the required density increases with redshift. e.g., dX z - 0.6, we 
need a galaxy density of = 2.1 x 10~^ h?M^c~^ ; at z = 1.5, 
h - 10""^ /z^Mpc"^. The full trend in redshift bins is given in 
Table [T] and in Figures [T] a) and b) which show equation ([T]) as a 
function of redshift for k ^ 0.063 h IVIpc"^ and k ^ 0.12 h IVlpc"^ 
(the location of the first and the second harmonics of the BAO peak 
in the linear power spectrum). 

In order to minimize the sample variance, we must sample the 
largest possible volume (a volume of 1 Gpc^ roughly corre- 
sponds to a precision in the BAO scale measurement of 5 percent). 
To quantify this calculation, we use the effective volume sampled 
Veff, defined as (Tegmark 1997b) 



)b\z)P{r,k) 



(r)b\z)P(r,k) 



(2) 



In this calculation, we assume a linear bias according to the DEEP2 
study by [Coil et alT| p008| ) that varies according to the redshift as 
b(z) = Z?o(l + z), with b(z = 0.8) = 1.3. The bias could be larger 
for the more luminous ELGs, that are thought to be the progenitors 
of massive red galaxies (fC ooper et al.|2008 ). We shall evaluate the 
bias of ELGs more precisely in a future paper. The corresponding 
area to be surveyed in order to reach VefF ~ IGpc^ is shown 
in Table [T] setting redshift bins of width Az = 0.3 from z = 0.3 to 
z = 1.8. The Figure[T]c) shows the behavior of Veff as a function of 
the area for a given slice of redshift with h given in the third column 
of Table[T] For the redshift range [0.6, 0.9] the survey area must be 
>2,500 deg^. For the redshift range [0.9, 1.2] the survey area must 
be > 1,600 deg^. The observation of [0.6, 1.7] with a single galaxy 
selection thus needs 2,500 deg^ to sample the BAO at all redshifts. 



Reconstruction of the galaxy field 

To obtain a high precision on the measure of the BAO scale, it is 
necessary to correct the 2-point correlation function from the dom- 
inant non-linear effect of clustering. The bulk flows at a scale of 
20 Mpc that form large scale structures smear the BAO peak: it 
is smoothed by the velocity of pairs (At redshift 1 the rms displace- 
ment for biased tracers due to bulk flows is 8.5 Mpc in rea l 
space and 17 Mpc in redshift space) (Eisenstein et al.|2007a b I. 



Reconstruction consists in correcting this smoothing effect. 
The key quantity that allows reconstruction on a data sample is the 
smoothing scale used to reconstruct the velocity field and should 
be as close to 5 Mpc as possible in order to measure the bulk 
flows without being biased by other non-linear effects that occur on 
smaller scales. 

The reconstruction algorithm applied on the SDSS-II Data Re- 
lease 7 fAbazaji an et al.|2009] ) LRG sample sharpens the BAO fea- 
ture and reduces the errors from 3.5 percent to 1.9 percent. This 
sample has a density of tracers of 10 ""^ Mpc~^ and the optim um 
smoothing applied is 15 Mpc ( ^Padmanabhan et al.|2012| l. On 
the SDSS-III/BOSS data in our study (different patches cover 3,275 
deg^ on a total of 10,000 deg^), reconstruction sharpens the BAO 
peak allowing a detection at high significance, but does not signif- 
icantly improve the precision on the distance measure due to the 
gaps in the current survey (see jAnderson et al.|2012| . 

To allow an optimum reconstruction using a smoothing three 
times smaller (5 Mpc) it is necessary to have a dense and con- 
tiguous galaxy survey : gaps in the survey footprint smaller than 
1 Mpc and a sampling density higher than 3 x 10""^ Mpc~^. 
This setting should reduce the sample variance error on the acous- 
tic scale by a factor four. 



2.2 Observational requirements 

A mean galaxy density of 3 x 10~^ Mpc~^ can be reached by 
a projected density of 162 galaxies deg~^ with 0.6 < z < 0.9, 261 
deg"^ with 0.9 < z < 1.2, 354 with 1.2 < z < 1.5, and 589 with 
1.5 < z < 1.8. Considering a simple case where a survey is divided 
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in three depths, the shallow one covering 2,500 deg^ should contain 
419,000 galaxies ; the medium 421,000 galaxies over 1,600 deg^ 
; and the deep 435,000 galaxies over 1,200 deg^. This represents 
a survey containing 1,350,000 measured redshifts in the redshift 
range [0.6, 1.5]. The challenge is to build a selection function that 
enhances the observation of these projected densities. 

Given a ground-based large spectroscopic program that mea- 
sures 1.5 X 10^ spectra (it corresponds to about 4 years of dark time 
operations on SDSS telescope dedicated to ELGs), the challenge is 
to define a selection criterion that samples galaxies to measure the 
BAO on the greatest redshift range possible. We define the selec- 
tion efficiency as the ratio of the number of spectra in the desired 
redshift range and the number of measured spectra. The example 
in the previous paragraph needs a selection with an efficiency of 
1.35/1.5 -90 percent. 



2.3 Previous galaxy targets selections 

To reach densities of tracers > 10""^ Mpc~^ at z > 0.6 with a high 
efficiency, a simple magnitude cut is not enough. Such a selection 
would be largely dominated by low-redshift galaxies. The use of 
colour selections is necessary to narrow the redshift range of the 
target selection for observations. 

SDSS-I/II galaxies are selected with visible colours in the red 
end of the colour distribution of galaxies, resulting in a sample of 
LRG and not ELGs ( [Eisenstein et al.|2()0T| ). The projected density 
of LRG is ~ 120 deg~^ with a peak in the redshift distribution at 
z ~ 0.35. With the SDSS-I^I LRG sample, the distance redshift 
relation was reconstructed at 2 percent at z = 0.35. 

BOSS has currently completed about half of its observation 
plan. The tracers used by BOSS are, as SDSS-I/II LRG, selected 
in the red end of the color distribution of galaxies, they are called 
CMASS (it stands for 'constant mass' galaxies) and the selection 
will be detailed in Padmanabhan et al. in prep. (2012). The current 
BAO detection using the data release 9 (a third of the observation 
plan) with the CMASS tracers at z ~ 0.57 has a 6.1 cr significance 
( [Anderson et al.|2012| ). 

WiggleZ blue galaxies are selected using UV and visible 
colours: they have a density of 240 galaxies deg~^ and a peak in 
the redshift distribution around z - 0.6 ( ^Drinkwater et al.|2010| ). 
The WiggleZ experiment has obtained a 4.9cr detection of the BAO 
peak at z = 0.6 (Bl ake et al.|2011| ). 

At their peak density, both of these BAO surveys reach a 
galaxy density of 3 x 10~^ Mpc~^, which guarantees a signif- 
icant detection of the BAO. 

Galaxy selections beyond z = 0.6 were already performed 
by surveys such as th e VIMOS-VLT Deep Surve3p] (VVDS, see 
Le Fevre et al.|2005a| ), DEEP20(see Davis et al. 2003 ) or Vimos 
Public Extragalactic Redshift Surve}]^ (VIPERS, see Guzzo et al. 
2012, in preparation), but they are not tuned for a BAO analysis. 
The DEEP2 Survey selected galaxies using BRI photometry in the 
redshift range 0.75-1 .4 on a few square degrees with a redshift suc- 
cess of 75 percent using the Keck Observatory. It studied the evolu- 
tion of properties of galaxies and the evolution of the clustering of 
galaxies compared to samples at low-redshift. In particular, insights 
in galaxy clustering to z = 1 brings a strong knowledge about the 
bias of these galaxies ( [Coil et aL]|2008| ). The VVDS wide survey 



observed 20 000 redshift on 4 deg^ limited to Iab < 22.5 (iG arillil 
|et al.p OOS ); they studied the properties of the galaxy population 
to redshift 1.2 and the small scale clustering around z - \. The 
VIPERS survey maps the large scale distribution of 100 000 galax- 
ies on 24 deg^ in the redshift range 0.5 - 1.2 to study mainly clus- 
tering and redshift space distortions. Their colour selection, based 
on ugri bands, is described in more detail in the section [6] 



3 COLOR SELECTIONS 

Our aim is to explore different colour selections that focus on galax- 
ies located in 0.6 < z < 1.7 with strong emission lines, so that as- 
signing redshifts to these galaxies is feasible within short exposure 
times (typically one hour of integration on the 2.5m SDSS tele- 
scope). The methodology used here has been first explored and ex- 
perimented by Davis et al. ( 2003 ), Adelberger et al. 2004 Drinkwa-j 
|ter et al.||2010] j Adelberger et al. ( 2004) derived different colour 
selections for faint galaxies (with 23 < R < 25.5) at redshifts 
1 < z < 3 based on the Great Observatories Origins Deep Survey 
data (GOODS, see Dickin son et al.|2003| l. |Drinkwater etaL| ( [20T0l ) 
selected ELGs using UV photometry from the Medium Imaging 
Survey of the GAlaxy EVolution EXplorer (MIS-GALEX, see|Mar] 
[tin et al.|20b 5) data combined with SDSS, to obtain a final density 
of 238 ELGs per square degree with 0.2 < z < 0.8 over ~ 800 
square degrees. 

Our motivation is to probe much wider surveys than GOODS 
or GALEX (ultimately a few thousands square degrees) and to 
concentrate on intrinsically more luminous galaxies (typically with 
g < 23.5) with a redshift distribution extended to redshift 1.7. 

The selection criteria studied in this work are designed for a 
ground-based survey and more specifically for the SDSS telescope, 
a 2.5m telescope located at Apache Point Observatory (New Mex- 
ico, USA), which has a unique wide field of view to carry out LSS 
studies ( |Gunn et al.|2006| t. The current BOSS spectrographs cover 
a wavelength range of 3600 - 10200A. Its spectral resolution, de- 
fined by the wavelength divided by the resolution ele ment, varies 
from ~ 1, 600 at 3, 600A to ~ 3, 000 at 10, OOOA JEisenstein 



|et al.|20lT] ). The highest redshift detectable with the [On] emission 
line doublet (^3727,^3729) is thus Zmax = 1-7. 

To select ELGs in the redshift range [0.6, 1.7] we have ex- 
plored two diff'erent selection schemes: first using w,g,r photometry 
and secondly using g,r,i photometry. 



3.1 Photometric data properties: SDSS, CFHT-LS and 
COSMOS 

The photometric SDSS survey, delivered under the data release 8 
(DR8, |Aihara et al.|201l| , covers 14,555 square degrees in the 5 
photometric bands u, g, r, i, z. It is the largest volume multi-color 
extragalactic photometric survey available today. The 3cr magni- 
tude depths are: u - 22.0, g - 22.2, r - 22.2, / = 21.3; see 
Fukugita et al.| ( |1996| ) for the description of the filters and Gunn| 
et al.| ( |1998| ) br the characteristics of the camera. The masnitudes 
we use are corrected from galactic extinction. 

The Canada France Hawaii Telescope Legacy Surve}]^ (here- 
after CFHTLS) covers ~ 155 deg^ in the u,g,r,i,z bands. The trans- 
mission curves of the filters differ slightl3Qfrom SDSS. The data 



^ http://cesam.oamp.fr/vvdsproject/ 
^ http ://deep . berkeley. edu/index .html 
^ http://vipers .inaf.it/proj ect.html 



^ http://www.cfht.hawaii.edu/Science/CFHLS/ 

^ http ://cadc w w w. dao . nrc . c a/megapipe/docs/ filters .html 
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Figure 3. Color-color diagrams tinted according to CFHT-LS photometric redshift. Colour selections are indicated with the dashed boxes. Quasars, when 
overlapping with the colour selection, are located in the pink ellipse. The stellar sequence appears in black, a) The bright ugr colour selection: u-g vs g-r for 
20 < g < 22.5. In this area the overall density of targets is low. b) The faint ugr colour selection where the magnitude range is 22.5 < g < 23.5. The density 
of available targets increases and the stellar sequence is diminished, c) The bright gri colour selection: r-i vs g-r diagram for 19 < / < 21.3. The high-redshift 
targets are not as visible as in the ugr plot. The target density is also small, d) The faint gri colour selection where the magnitude range is 21.3 < / < 23. The 
target density increases and higher redshift targets appear in the blue end of the colour plot while the stellar locus fades. Using SDSS photometry one would 
obtain similar plots to a) and c) in terms of target density. 
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Figure 2. The four bands ugri and their precision are illustrated; in red for 
SDSS photometry; in blue for CFHTLS photometry. The u band quality 
is limiting the precision of the colour selection on SDSS photometry. Note 
that the photometric redshift CFHTLS catalog is cut at i = 24, and the SDSS 
data is R-selected with errR < 0.2. 



and cataloging methods are described in the T0006 release docu- 
men{^ The 3cr magnitude depths are: u = 25.3, g = 25.5, r = 24.8, 
/ = 24.5. The CFHT-LS photometry is ten times (in r and /) to 
thirty times (in u) deeper than SDSS DR8, however the CFHTLS 
covers a much smaller field of view than SDSS DR8. The magni- 
tudes we use are corrected from galactic extinction. The CFHT-LS 
photometric redshift catalogs are presented in Ilbert et al.| ( |2006| l, 
and jCoupon et al.| ( [2009| ) ; the photometric redshift accuracy is esti- 
mated to be cr^ < 0.04(1 +z) for g < 22.5. This photometric redshift 
catalog is cut at / = 24, beyond which photometric redshifts are 
highly unreliable. Fig. |2] displays the relative depth between SDSS 
and CFHT-LS wide surveys in the u, g,r, i-hmids. 

COSMOS is a deep 2 deg^ survey that has been observed 
at more than 30 diff'erent wavelengths ( Scoville et al. 2007 ). The 
COSMOS photometric catalog is described in Capak et al. ( 2607] ) 
and the photometric redshifts in [Ilbert et al.| p009| ). The COS- 
MOS Mock Catalog, (hereafter CMC; se^ is a simulated spectro- 
photometric catalog based on the COSMOS photometric catalog 
and its photometric redshift catalog. The magnitudes of an object 
in any filter can be computed using the photometric redshift best- 
fit spectral templates ( [Jouvel et al.||2009| Zoubian et al. 2012, in 
preparation). 

The limiting magnitudes of the CMC in the each band are the 
same as in the real COSMOS catalog (detection at 5cr in a 3" diam- 
eter aperture): u < 26.4, g < 27, r < 26.8, / < 26.2. For magnitudes 



^ http://terapix.iap.fr/cplt/T0006-doc.pdf 

^ http://lamwws.oamp.fr/cosmowiki/RealisticSpectroPhotCat 
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in the range 14 < m < 26 in the g,r,i bands from the Subaru tele- 
scope and in the u band from CFHTLS, the CMC contains about 
280,000 galaxies in 2 deg^ to COSMOS depth. The mock catalog 
also contains a simulated spectrum for each galaxy. These simu- 
lated spectra are generated with the templates used to fit COSMOS 
photometric redshifts. Emission lines are empirically added using 
Kennicutt calibration laws ( |Kennicutt| 1998 , Ilbert et al. 2009 ), and 
have been calibrated using zCOSMOS (Li lly et al.| |2009) as de- 
scribed in Zoubian et al. 2012, in preparation. The strength of [On] 
emission lines was confirmed using DEEP2 and VVDS DEEP lu- 
minosity functions ( |Le Fevre et al.|2005b|rZhu et al.|20"09l ). Finally 
a host galaxy extinction law is applied to each spectrum. Predicted 
observed magnitudes take into account the presence of emission 
lines. 



3.2 Color selections 

Based on the COSMOS and CFHT-LS photometric redshifts, we 
explore two simple colour selection functions using the ugr and gri 
bands. Fig. [3] shows the targets available in the ugr and gri colour 
planes. We construct a bright and a faint sample based on the pho- 
tometric depths of SDSS and CFHT-LS. 

3.2.1 ugr selection 

The ugr colour selection is defined by -1 < u-r < 0.5 and -1 < g- 
r < 1 that selects galaxies at z ^ 0.6 and ensures that these galaxies 
are strongly star-forming {u-r cut). The cut -1 < u - g < 0.5 
removes all low-redshift galaxies {z < 0.3). Finally the magnitude 
range is 20 < g < 22.5 and g <23.5 for the bright and the faint 
samples, resp. Fig.|3]a) and b). 

3.2.2 gri selection 

The bright gri colour selection is defined by the range 19 < / < 
21.3. We select blue galaxies at z ~ 0.8 with 0.8 < r - / < 1.4 and 
-0.2 < g - r < 1.1 (Fig.[3]c). In the faint range 21.3 < / < 23, we 
tilt the selection to select higher redshifts with -0.4 < g - r < 0.4, 
-0.2 < r - / < 1.2 and g - r < r - / (Fig.|3]d). 

3.3 Predicted properties of the selected samples 

The ugr colour selection avoids the stellar sequence, but not the 
quasar sequence. Hence, the contamination of the ugr selection 
by point-source objects is primarily due to quasars; see Fig. [3] a) 
and b). The resulting photometric-redshift distribution as derived 
from the CFHT-LS photometric redshift catalog has a wide span 
in redshift, covering 0.6 < z < 2 as shown in Fig. |4] The distri- 
bution is centered at z = 1.3 for the bright and the faint sample 
with a scatter of 0.3 (see Table [2|. The expected [On] fluxes are 
computed from the CMC catalog and are shown in Fig. |4] For 90 
percent of galaxies in the faint sample, the predicted flux is above 
10.6 X 10~^^ergcm~^ s~^ The bright sample galaxies show strong 
emission lines. 

The gri selection avoids both the stellar sequence and the 
quasar sequence; see Fig.|3]c) and d). Thus the contamination from 
point-sources should be minimal. Fig.|4]shows the photometric red- 
shift distribution of the gri selection applied to CFHT photometry. 
The redshifts are centered at z = 0.8 for the bright and 1.0 for the 
faint sample (see Table [2]). The expected [On] flux, computed with 



the CMC catalog, is shown in Fig.|4] Emissions are weaker than for 
the ugr selection as expected. 

The diff'erent selections shown in Fig.|3]and Fig|4]are summa- 
rized in Table [2] which contains the number densities available, 
mean magnitudes, mean redshifts, and mean [On] fluxes (when 
available) of the diff'erent samples considered. We have lower den- 
sities in the CMC than in the CFHT-LS catalog. This is probably 
due to cosmic variance as the CMC only covers 2 deg^. 

The SDSS colour- selected samples are complete for the bright 
samples at g < 22.5 and / < 21.3, not for the faint samples. The 
CFHTLS-selected samples are complete for both the bright and 
faint samples; see Fig[5] where the total cumulative number counts 
(solid line) of the ugr and gri colour- selected samples are plotted 
as a function of g and / bands respectively. On the bright end of 
this Figure, although both photometry are complete at the bright 
limit, we note a discrepancy between the total amount of target se- 
lected on CFHT and SDSS that implies selections on CFHT are 
denser than on SDSS (diff'erence between the red and blue solid 
lines). This is due to the transposition of the color selection from 
one photometric system to the other. In fact, we select targets on 
SDSS with a transposed criterion from CFHT using the calibra- 
tions by |Regnault et al.| ( [200 9). The transposed criterion is as tight 
as the original. But as the errors on the magnitude are larger in the 
SDSS system, their colour distributions are more spread. Therefore 
the SDSS selection is a little less dense than the CFHT selection. 

Targeting the bright range is limited by galaxy density, in the 
best case one can reach 300 targets deg~^ and it contains point- 
sources (stars and quasars) and low-redshift galaxies. In the faint 
range, the target density is ten times greater, but the exposure time 
necessary to assign a reliable redshift will be much longer (one 
magnitude deeper for a continuum-based redshift roughly corre- 
sponds to an exposure five times longer). The stellar, quasars and 
low-redshift contamination is smaller in the faint range. Fig. |4] 
shows the distributions in redshift and in [On] flux we expect given 
a magnitude range and a colour criterion within the framework of 
the CMC simulation. The main trend is that the ugr selection identi- 
fies strong [On] emitters out to z ~ 2 where the gri peaks at redshift 
1 and extends to 1.4 with weaker [On] emitters. 

We also used a criterion to split targets in terms of compact 
and extended sources, which is illustrated in Fig. [5] For CFHT-LS 
we have used the half-light radius (r2 value, to be compared to the 
^hmit y^i^g which defines the maximal size of the PSF at the loca- 
tion of the object considered - see Coupon et al 2009 and CFHT-LS 
T0006 release document) to divide the sample into compact and 
extended objects. For SDSS we used the "type" flag, which sep- 
arates compact (type=6) from extended objects (type=3). For the 
ugr colour selection, the number counts are dominated by compact 
blue objects (quasars) at g < 22.2. At g ^ 22.2 the counts are dom- 
inated by extended ELGs. For comparison we show in Fig. |5] the 
cumulative counts of the XDQSO catalog from Bovy et al. ( 2011 ) 
who identified quasars in the SDSS limited to g < 21.5. We notice 
an excellent match with the bright (compact) ugr colour- selected 
objects. For the gri colour selection, there is a low contamination 
by compact objects because the colour box does not overlap with 
either the stellar or the quasar sequence. 



4 ELG OBSERVATIONS 

To test the reliability of both the bright ugr (g < 22.5) and the 
bright gri (i < 21.3) colour selections, we have conducted a set 
of dedicated observations, as part of the "Emission Line Galaxy 



Investigating Emission Line Galaxy Surveys with the Sloan Digital Sky Survey Infrastructure 7 




1.0 

redshift 




10-16 

Flux [On] 



Figure 4. Redshift and line flux distribution for various color selections. The blue solid line is the ugr selection limited to 20 < g < 22.5. The blue dashed line 
is the ugr selection limited to 22.5 < g < 23.5. The red solid line is the gri selection limited to 19 < / < 21.3. The red dashed line is the gri selection limited to 
21.3 </ < 23.0. Left. Photometric redshift distribution from the CFHT-LS catalog. Black solid line: no colour selection, but limited to i < 24. The ugr selection 
is more spread in redshift than the gri selection. The vertical dashed line indicates the upper limit of z = 1.7 that corresponds to [On] emission lines at ~ l//m. 
Right. Expected [On] flux distribution from the CMC simulation. The black solid line is the complete CMC catalog. The ugr selection identifies stronger line 
emitters than the gri selection scheme. The vertical dashed line indicates the expected mean sensitivity of BOSS in Ih exposure, 5 x 10~^^erg s~^ cm~^ . 




Figure 5. Cumulative number counts per square degree of both colour selections using SDSS photometry (in red) and CFHTLS photometry (in blue). The 
solid line is the whole selection, while the dashed line is for compact (star-like) objects and the dotted line represents extended objects. On the left is the ugr 
selection and on the right is the gri selection. The black dashed line represents objects from the XDQSO catalog that have a probability of being a quasar that 
is greater than 90 percent (Bovy et al.,2011j (the quasar identifications are limited to g = 21.5). 
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Table 2. Properties of the colour-selected samples, 'b' stands for bright (20 < g < 22.5 for ugr or 19 < / < 21.3 for gri) and 'f for faint (22.5 < g < 23.5 
for ugr or 21.3 < / < 23 for gri). We indicate the mean and standard deviation of the redshift distribution for each set. For the CMC sample, [On] fluxes are 
expressed in units of 10~^ '^erg cm~^ s~^ ; the median, the first and third quartiles of the flux distribution are indicated. The ugr tends to select the highest [On] 
fluxes. 



selection 




#deg-2 


u 


g 


r 


i 


z 




/[Oil] 


^/[OII] 






ugr 


b 


130.0 


21.98 


21.87 


21.69 




1.25 


0.53 


61.74 


46.47 


88.39 


CMC 


f 


1450.8 


23.27 


23.18 


22.98 




1.19 


0.38 


16.60 


13.06 


22.26 


gri 


b 


257.2 




22.69 


21.87 


20.93 


0.80 


0.21 


13.85 


8.65 


22.21 




f 


2170.5 




23.34 


23.09 


22.55 


0.93 


0.31 


10.23 


6.83 


15.99 




ugr 


b 


193.3 


21.95 


21.8 


21.7 




1.28 


0.38 








CFHT-Wl 


f 
b 


1766.8 
361.4 


23.37 


23.19 
22.62 


23.07 
21.8 


20.82 


1.29 
0.81 


0.31 
0.11 










gri 


f 


3317.5 




23.34 


23.11 


22.55 


1.03 


0.35 










ugr 


b 


232.2 


21.89 


21.76 


21.69 




1.27 


0.37 








CFHT-W3 


f 
b 


1679.1 
391.6 


23.36 


23.18 
22.62 


23.06 
21.78 


20.8 


1.28 
0.82 


0.31 
0.1 










gri 


f 


3334.2 




23.34 


23.11 


22.54 


1.03 


0.33 








SDSS 


ugr 


b 


166.96 


21.76 


21.77 


21.52 














gri 


b 


204.96 




22.57 


21.75 


20.76 













SDSS-III/BOSS ancillary program". The observations were con- 
ducted between Autumn 2010 and Spring 2011 using the SDSS 
telescope with the BOSS spectrograph at Apache Point Observa- 
tory. A total of ~2,000 spectra, observed 4 times 15 minutes, were 
taken in different fields: namely, in the Stripe 82 (using single epoch 
SDSS photometry for colour selection) and in the CFHT-LS Wl, 
W3 and W4 wide fields (using CFHT-LS photometry). This data 
set was released in the SDSS-III Data Release ^3 



4.1 Description of SDSS-III/BOSS spectra 

We used the SDSS photometric catalog ( [Aihara et al.|201ll ) to se- 
lect 313 objects according to their ugr colours located in the Stripe- 
82 and 899 objects selected according to their gri colours in the 
CFHT-LS W3 field. In addition we used the CFHT-LS photome- 
try to select 878 ugr targets in the CFHT-LS Wl field, and 391 gri 
targets in the CFHT-LS W3 field for observation. The spectra are 
available in SDSS Data Release 9 and flagged 'ELG' . 

All of these spectra were manually inspected to confirm or 
correct the redshifts produced by two diflferent pipelines (zCode 
and its modified version that we used to fit the [On] emission line 
doublet). As the BOSS pipeline redshift measurement is designed 
to fit LRG continuum some ELG with no continuum were assigned 
wrong redshifts. To classify the observed objects, we have defined 
seven sub-categories : 



Objects with secure redshifts 

• 'ELG', Emission-line galaxy (redshift determined with mul- 
tiple emission lines). Usually these spectra have a weak 'blue' 
continuum and lack a 'red' continuum. Empirically, using plat- 
EFiT viMos pipeline output, this class corresponds to a spectrum 
with more than two emission lines with observed equivalent widths 
< -6A ; see examples in Appendix [a| 



http://dr9.sdss3.org/ 



• 'RG', Red Galaxy with continuum in the red part of its spec- 
trum, allowing a secure redshift measurement through multiple ab- 
sorption lines ( e.g. Ca K&H, B aimer lines) and the 4000A break. 
Some of these objects have also weak emission (E+A galaxies). 
Empirically their spectra have a mean D;^(4000) of 1.3 ; where 
Z)„(4000) is the ratio of the continuum level after the break and 
before the break. These galaxies typically have / ~ 20, which is 
fainter than the CMASS targeted by BOSS. 

• 'QSO', Quasars, which are identified through multiple broad 
lines. Examples are given in Fig. |A4| 

• Stars. 

Objects with unreliable redshifts 

• 'Single emission line' : the spectra contain only a single emis- 
sion line which cannot allow a unique redshift determination. For 
this population, the CFHT T0006 photometric redshifts are com- 
pared to the [On] redshift (assuming the single emission line is 
[On]) in Fig. [6] The two estimates agree very well : 77.7 percent 
have {Zspec - Zphot)/(^ + Zspec) < 0.1 for the gri selection and 62.7 
percent for the ugr selection. These galaxies with uncertain redshift 
tend to have slightly fainter magnitudes with a mean CFHT g mag- 
nitude at 22.6 and a scatter of 0.6, whereas for the whole ELGs is 
22.4 with a scatter of 0.4. 

• 'Low continuum' spectra that show a 4000A break too weak 
for a secure redshift estimate. The agreement between photomet- 
ric and spectroscopic redshift estimation is excellent : 84.6 percent 
within 10 percent errors; see Fig. [6] 

• 'Bad data', the spectrum is either featureless, extremely noisy 
or both. 

The detailed physical properties of the ELGs are discussed in sec- 
tion |5] and a number of representative spectra are displayed in Ap- 
pendix |A| 

4.2 Redshift Identification 

The results of the observations are summarized by categories in 
TableEl 
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Figure 6. T0006 CFHT-LS photometric redshifts of single emission line 
and low continuum galaxies observed against [On] redshift. A strong cor- 
relation is clearly evident. A slight systematic over-estimation of the pho- 
tometric redshift is visible above redshift 1 .2 (these photometric redshifts 
were calibrated below 1.2). 




0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 
redshift 

Figure 7. Observed redshift distribution for the ugr ELGs (blue), the gri 
ELGs (black) compared to the distribution of galaxies from BOSS (red) 
and WiggleZ (green). Magenta lines represent constant density of galaxies 
at 1 and 3 xlO~^ Mpc~^, it constitutes our density goals. 



For the targets selected using SDSS photometry and with the 
ugr selection : 32 percent are ELGs at a redshift z > 0.6 (100 spec- 
tra). The low-redshift ELGs represent 32 percent of the observed 
targets (101 spectra). The other categories are : 65 'bad data' (20 
percent), 30 quasars (10 percent), 10 stars (3.5 percent), and 7 red 
galaxies with z < 0.6 (2.5 percent). With the gn-selection, 57 per- 
cent of the targets are at z > 0.6. However, still 21 percent of the 
spectra fall into the bad data class. 

Using CFHTLS photometry 46 percent of targets are ELGs at 
z > 0.6 and 14 percent are quasars with the ugr selection. With 
the gn-selection, 73 percent are galaxies at z > 0.6, five-sixths of 
which are ELG. 

For both selections, targeting with CFHTLS is more efficient 
than with SDSS. The complete classification of observed targets 
is in Table [3] The redshift distribution of the observed objects is 
compared to the distributions from the BOSS and WiggleZ current 
BAO experiments in Fig. [7] The Figure shows that ugr and gri tar- 
get selections enable a BAO study at higher redshifts. With a joint 
selection, we can reach the requirements described in Table [T] to 
detect BAO feature to redshift 1 . 



4.3 Comparison of measured ELGs with the CMC forecasts 

To investigate the expected purity of ELG galaxies samples, we cre- 
ated mock catalogs covering redshifts between 0.6 and 1.7. Con- 
tinuum spectra of ELGs were generated from the Cosmos Mock 
Catalog and emission lines were added according to the modeling 
described in |Jouvel et al.| ( [2009] ). Two simulated galaxy catalogs 
were built, one for each colour selection function (ugr and gri). 
Each synthetic spectrum was aff'ected by sky and photon noise as if 
observed by BOSS spectrographs, by using the specsimId software. 



We simulated a set of four exposures of 900 seconds each. The re- 
sulting simulated spectra were then analyzed by the zCode pipeline 
( [Cannon et al.|2006| to extract the spectroscopic redshift. As our 
targets are mainly emission line galaxies, we only use the redshift 
estimate based on fitting discrete emission line templates in Fourier 
space over all z. 

We address the flux measurement of emission lines. This exer- 
cise was conducted using the Platefit Vimos software developed by 
jLamareille et al.| (>2009). This software is based on the platefit soft- 
ware that was developed to analyze SDSS spectra ( jTremonti et al.j 
|2004a|[Brin'chmann et al.|2004| ). The PLATEFIT VIMOS softwarc was 
developed to measure the flux of all emission lines after removing 
the stellar continuum and absorption lines from lower resolution 
and lower signal-to-noise ratio spectra ( Lamareille et al. 2006 ). The 
stellar component of each spectrum is fit by a non-negative linear 
combination of 30 single stellar population templates with differ- 
ent ages (0.005, 0.025, 0.10, 0.29, 0.64, 0.90, 1.4, 2.5, 5 and 11 
Gyr) and metallicities (0.2, 1 and 2.5 Z©). These templates have 
been derived using the |Bruzual & Ch ariot ( 2003 ) libraries and have 
been resampled to the velocity dispersion of VVDS spectra. The 
dust attenuation in the stellar population model is left as a free pa- 
rameter. Foreground dust attenuation from the Milky Way has been 
corrected using the Schle gel et al.| ( [T99 8 ) maps. 

After removal of the stellar component, the emission lines are 
fit as a single nebular spectrum made of a sum of Gaussians at spec- 
ified wavelengths. All emission lines are set to have the same width, 
with the exception of the [On] /13727 line which is a doublet of 
two lines at 3726 and 3729 A that appear broadened compared to 
other single lines. Detected emission lines may also be removed 
from the original spectrum in order to obtain the observed stellar 
spectrum and measure indices, as well as emission-line equivalent 
widths. The underlying continuum is obtained by smoothing the 
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Table 3. Observed objects split in categories. 







gri 


selection 






ugr 


selection 






SDSS selection 


CFHTLS selection 


SDSS selection 


CFHTLS selection 


Type 


Number 


% 


Number 


% 


Number 


% 


Number 


% 


ELGU > 0.6) 


450 


50 


240 


61 


100 


32 


402 


46 


ELG(z < 0.6) 


60 


7 


3 


1 


101 


32 


84 


9 


RG(z > 0.6) 


73 


8 


46 


12 














RG(z < 0.6) 


30 


3 








7 


3 








single emission line 


36 


4 


12 


3 








102 


12 


low continuum 


13 


1 


1 

















QSO 


8 


1 


5 


1 


30 


10 


126 


14 


stars 


44 


5 


12 


3 


10 


3 


6 


1 


bad data 


185 


21 


72 


18 


65 


20 


158 


18 


total 


899 


100 


391 


100 


313 


100 


878 


100 




Redshift, Wavelength/3728 A 
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Figure 8. Top : Flux detected in the [On] doublet (green or red filled circle for a simulated spectrum and blue filled square for a real galaxy observation) 
versus redshift. The simulation is coloured according to the measured line flux divided by the error. The black solid line represents a typical sky as observed 
by BOSS spectrograph; the 5cr detections of [On] follow the sky flux. The [On] detections above 5cr (red circles) follow the tendency of the sky fluctuations 
: a 5cr detection in a zone with a high sky requires a larger emitted flux. The data are scaled to match a 1 hour exposure on the 2.5m SDSS telescope. The 
redshifts below the sky flux are assigned using other emission lines or the information in the continuum. Bottom: : An expansion of the top panel using z > 1 
detections. Only measured galaxies are plotted upon the mean sky (black) and the best sky (red) of APO. 
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stellar spectrum. Equivalent widths are then measured via direct in- 
tegration over a 5cr bandpass of the emission-line Gaussian model 
divided by the underlying continuum. Then emission lines fluxes 
are measured for each simulated spectra using the extracted red- 
shift from zCoDE and the true redshift for cross-checks. 

We consider that a redshift has been successfully measured if 
Az/(1 + z) < 0.001. We beheve that this threshold could be low- 
ered to 10~^ in the future by using a more advanced redshift solver. 
Using the current pipeline, we can distinguish these two regimes. 

The first regime is the redshift range z < 1.0. Many emission 
lines ([Oil], H/?, [OIII]) are present in the SDSS spectrum. For g < 
23.5, 91 percent of the redshift are measured sucessfully. Among 
the remaining 9 percent, catastrophic failures represent 3.5 percent 
(the pipeline outputs a redshift between and 1.6 with Az/(1 + 
z) > 0.01). Inaccurate redshifts represent 3.9 percent (the pipeline 
outputs a redshift between and 1.6 with 0.001 < Az/(l+z) < 0.01) 
and 1.5 percent are not found by the pipeline (z = -9 is output). 

The second regime is the redshift range 1.0 < z < 1.7: the red- 
shift determination hinges on the identification of the [On] doublet. 
For g < 23.5, 66.8 percent of the redshifts are measured sucessfully. 
19.1 percent are catastrophic failures and 14.1 percent are inaccu- 
rate redshifts. Work is ongoing to improve the redshift measure- 
ment efficiency at z > 1 . In the second regime, the minimum [On] 
flux required to compute a reliable redshift depends on the redshift 
/ wavelength, because of the strong OH sky lines in the spectrum. 
We infer from the observed spectra that to measure a reliable red- 
shift, we require a 5cr detection of the [On] lines, which means a 
(blended or not) detection of two peaks in the emission line sepa- 
rated by 2(1 + z). The detection significance is defined from the Id 
spectrum. From the data the faintest 5cr detections are made with a 
flux of 4 X 10~^^erg s~^ cm~^ and the brightest 5cr detections need a 
flux of 2 X 10~^^erg s~^ cm~^ to be on top of sky lines. The simula- 
tion shows the same thresholds; see Fig. [8] The simulation confirms 
the detection limit we observe. The bottom plot of Fig.[8]raises the 
issue that the sky time variation has a non-negligible impact on the 
detection limit of the [On] emission doublet for redshifts z > 1.1. 
Though this ELG sample is too small to address this issue. In fact 
the sample was observed during ten diff'erent nights and the number 
of ELG with z > 1.1 is less than 60. It is thus not possible to de- 
rive a robust trend comparing the [On] detections to the sky value 
of each observation. Handling this issue would require a sample of 
~500 redshifts in 1.1 < z < 1.6 observed many times over many 
nights. With such a sample in hands, we could quantify exactly how 
to optimize the observational strategy. 



5 PHYSICAL PROPERTIES OF ELGS 

All ugr and gri ELG spectra were analyzed with two diff'erent soft- 
ware packages: the PlateFit VIMOS ( Lamareille et al.,2009^ and 
the Portsmouth Spectroscopic Pipeline ( Thomas et al.||2012| . In 
this section we discuss the following physical properties of the ob- 
served ELGs: redshift, star forming rate (SFR), stellar mass, metal- 
licity and classification of the ELG type (Seyfert 2, LINERs, SFG, 
composite). Observations a larger samples of ELGs are planned to 
estimate how these quantities vary over time and with their envi- 
ronment, and also to estimate how the clustering depends on these 
physical quantities. It is key to replace future BAO tracers in the 
galaxy formation history. With the current sample, we draw simple 
trends using means and standard deviation of the observed quanti- 
ties, and we place the ELGs in the galaxy classification made by 
|Lamareille| ( [joTOl ); |Marocco et al.| ( [20n] ). 



5.1 Main Properties 

The main properties of the ELGs are shown in the Tab le [4| The 
star forming rate was computed using the equation 18 of |Argenc"e| 
|& Lamareille| ( |2009| ). The stellar mass was estimated using the 
CFHTLS ugriz photometry. (The errors on the stellar mass using 
only SDSS photometry were too large to be meaningful, thus the 
empty cells in the table). The metallicity is estimated using the cal- 
ibration by |Tremonti et al.| ( [2004b| ). The main trends are : 

• The gn-selected galaxies of CFHTLS are the more massive 
galaxies in terms of stellar mass. 

• The ugr selects stronger star-forming galaxies than gri (due 
to the w-band selection). There is a factor of two variations in the 
strength of the measured oxygen lines. 

• The ugr selects galaxies that have 12 + \og[OH] e [8,9] 
whereas gri focuses slightly more on the higher 12 + log [OH] ^ 9. 

• the SFR appears to be independent of the color selection 
schemes. 



5.2 Classification. 



We use a recent classification ( [Lamareille] |2010| |Marocco| 
|et al.| |2011| l for the ELG sample. The classification is 
made" using log([Oiii] ///^), log([Oii] ///^), D„(4000), and 
log(max(£W[oii],£^W[Neiii])). We compare the ELG sample to 
zCOSMOS, as zCOSMOS has numerous star forming galaxies 
in the redshift range we are observing. Fig. [9] a) shows that the 
zCOSMOS and the ugr ELG samples are located in three of the 
five areas delimited by the classification: Seyfert 2 ('Sy2'), Star 
Forming Galaxies ('SFG'), and a third region where both mix 
('Sy2/SFG'). There are a few LINERs and Composite in either 
sample. Fig.[9]b) separates the ugr galaxies in the 'Sy2/SFG' area 
into 'SFG' or 'Sy2', and shows that zCOSMOS galaxies from the 
'Sy2/SFG' area are both 'Sy2' and 'SFG' where the ugr ELGs in 
the 'Sy2/SFG' area are mostly 'SFG'. The gri observed sample is 
located in the area of Star Forming Galaxies ('SFG'), whether one 
considers the one selected on CFHT or on SDSS. Finally, the ELG 
selected, ugr or gri, are both in the 'SFG' part of the classification. 



6 DISCUSSION 

6.1 Redshift identification rates in ugr and gri 

We summarize the redshift measurement efficiency of the gri and 
ugr colour- selected galaxies presented in this paper in Tables [3] and 
5) and we compare the results with those of WiggleZ fD rinkwater| 
etari[20T0 l, BOSS and VIPERS (the percentages about VIPERS 
are based on a preliminary subset including only ~ 20 percent of 
the survey). The original VIPERS selection ffag (J. Coupon and O. 
Ilbert private communication) is defined to have colours compati- 
ble with an object at z > 0.5 if it has (r - / ^ 0.7 and w - g > 1.4) or 
(r-i ^ 0.5(u - g) and u-g < 1.4) (Guzzo et al. (2012), in prepara- 
tion). The efficiencies in the Table |5] show that a better photometry 
and thus more precise colours yield a better efficiency in terms of 
obtaining objects in the targeted redshift range. It also shows the 
colour selections proposed in this paper are competitive for build- 
ing an LSS sample. 

To determine the necessary precision on the photometry to 
stay at the efficiencies observed, we degrade the photometry of the 
observed ELGs, then reselect them and recompute the efficiencies. 
Using a photometry less precise than the CFHTLS by a factor of 2.5 
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Table 4. Main properties of the observed galaxies. Fluxes are in unit of 10~^^ erg cm~^ s~^ . Equivalent Widths 'EW are in A. 



gn-selected ELG w^r-selected ELG 

CFHTLS SDSS CFHTLS SDSS 





mean 


cr 


mean 


cr 


mean 


cr 


mean 


cr 


EW[on] 


-14.86 


9.01 


-16.75 


10.13 


-50.58 


27.24 


-30.75 


23.04 


Flux[oii] 


16.85 


9.65 


18.58 


10.37 


30.36 


30.1 


24.23 


39.27 




-10.28 


10.8 


-10.72 


8.65 


-24.27 


22.88 


-17.18 


19.34 


Flux//^ 


15.44 


8.6 


14.63 


7.72 


12.97 


15.16 


12.57 


23.91 


EW[oin] 


-10.09 


10.98 


-11.33 


10.76 


-65.3 


91.56 


-16.89 


30.49 


Flux[oni] 


17.74 


20.15 


17.43 


21.59 


35.13 


53.49 


13.39 


37.79 


12 + log OH 


8.94 


0.20 


8.92 


0.19 


8.69 


0.21 


8.69 


0.25 


logSFR[on] 


0.97 


0.35 


0.92 


0.45 


0.96 


1.24 


0.76 


0.84 


log(M7Mo) 


10.85 


0.3 


10.23 


6.87 


9.33 


0.80 








Figure 9. Black dots stands for CFHTLS selected ELGs, red dots for SDSS selected ELGs and grey contours or grey pixels for zCOSMOS survey 
galaxies, a) log([Oin] I Hp) vs. log([Oii] ///^) for ugr ELGs. The ugr ELG sample is located in a similar area than zCOSMOS galaxies, b) Z);^(4000) vs. 
log(mflx(£^W[oii], £^W[Neni])) for ugr ELGs. Only galaxies from the 'Sy2/SFG' area from the plot a) are represented. The ugr ELG is thus mainly composed 
of 'SFG'. c) log([Oiii] ///^) vs. log([On] ///^) for gri ELGs. The gri ELG sample is located in the 'SFG' area. 



in the errors (the ratio of the median values of the mag errors in bins 
of 0.1 in magnitude equals 2.5) does not significantly change nei- 
ther the efficiency nor the redshift distribution implied by the colour 
selection. This change also corresponds to loosening the colour cri- 
terion by 0.1 mag. For the eBOSS survey a photometry 2.5 times 
less precise than CFHTLS should be sufficient to maintain a high 
targeting efficiency (for comparison, SDSS is 10 times less precise 
than CFHTLS); Fig. shows the smearing of the galaxy positions 
in the colour-colour plane for a degraded photometry. 

6.2 Measurement of the [On] doublet, single emission line 
spectra 

For ground-based spectroscopic surveys observing ELGs with 1 < 
z < 1.7, the only emission line remaining in the spectrum to assign 
the spectroscopic redshift is the [On] doublet. For the redshift to 
be certain the doublet must be split (i.e., we do not want the target 
to be classified as 'single emission line' ELG). Fig. [TT] shows a 
subsample of the observed bright ugr ELGs where [On] doublets 
are well resolved. 

We can circumvent the 'single emission line' ELG issue (Fig. 
|6]) by increasing the resolution of the spectrograph. This modifica- 
tion will enhance a better split of [On], and will increase the room 



Table 5. Redshift efficiency in percent. The second column 'spectroscopic 
redshift' quantifies the amount of spectroscopic redshift obtained with the 
selection. The third column 'object in z window' is the number of spectro- 
scopic redshift that are in the range the survey is aiming at, it is the efficiency 
of the target selection. The redshift window for ELG selection is z > 0.6. 



selection 


spectroscopic 


object in 


quasars 


scheme 


redshift 


z window 




gri SDSS 


80 


62 


1 


gri CFHTLS 


82 


73 


1 


ugr SDSS 


80 


32 


10 


ugr CFHTLS 


78 


56 


13 


WiggleZ 


60 


35 




BOSS 


95 


95 




VIPERS 


80 


70 





available to observe the doublet by rendering sky lines 'thiner' . The 
sky acts as an observational window and prevents some narrow red- 
shift ranges to be sampled by the spectrograph; see Fig. [8] Increas- 
ing the resolution dilutes the signal, and thus the exposure time has 
to be increased to reconstruct properly the doublet above the mean 
sky level. 
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0.6 RA 37.93204 ; DEC -7.13792 

Z = 1.487 

g = 22.38 ; u-g = 0.02 
g-r = -0.12 ; r-i = 0.11 



0.4- 




wavelength [A] 

Figure 11. Observed Spectra zoomed on the [On] doublet, extracted from the ugr ELG sample. The doublet is enough split (apart from a Httle blending) to 
assign the correct spectroscopic redshift without the help of other emission lines. 



'single emission line' flux observed) contaminated by a noise of 
3 X 10~^^ergs~^ cm~^ (typical BOSS dark sky). The;^^ of the two 
fits are equal at low resolution and become disjoint in favor of the fit 
with 2 Gaussians for a resolution above 3000 at 7454.2A {i.e. [On] 
at redshift 1). Such an increase in resolution could help assigning 
proper redshifts to 'single emission line' ELGs. 

6.3 How / why redshift went wrong 

The main diff'erence in redshift measurement efficiency between 
SDSS and CFHT-LS colour selection is mainly due to the differ- 
ence in photometry depth. Using calibrations made by |Regnault| 
|et al.| ( |2009| , it is possible to translate the colour selection crite- 
ria from CFHT-LS magnitudes to SDSS magnitudes. The colour 
difference can be as large as 1 magnitude as the SDSS magnitude 
cut is close to the detection limit of the SDSS survey; see Fig. [12] 
where SDSS gri colour- selected galaxies are represented with their 
CFHTLS magnitudes. 



RA 38.26316 ; DEC -4.61358 
Z = 1.488 

g = 21.94 ; u-g = 0.22 
g-r = 0.04 ; r-i = 0.08 




G-I G-I 



Figure 10. U-R vs. G-I colored according to the photometric redshift. On 
the left CFHT-LS photometry, on the right CFHT-LS photometry degraded 
by a factor 2.5. This comparison shows how the degradation of the photom- 
etry smears the clean separations between galaxy populations in redshift. 



We performed a simulation of the [On] emission line fit to 
quantify by which amount the resolution must be increased to 
have no 'single emission line' ELGs. We fit one or two Gaus- 
sians on a doublet with a total flux of 10~^^ergs~^ cm~^ (lowest 



6.4 How to improve ELG selection for future surveys 

We suggest a few ways to increase the redshift measurement effi- 
ciency and reach the requirements set in the second section. 
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Figure 12. gri selection based on colours from SDSS (black box) repre- 
sented on CFHTLS magnitudes. The scatter is quite large: about half the 
targets would not have been selected if we used CFHTLS photometry. The 
'wanted' objects are galaxies at z > 0.6 or quasars and 'unwanted' objects 
are the rest. 



For the ugr selection : lowering u-g cut to 0.3 diminishes the 
contamination by low-redshifts galaxies. Additional low-redshift 
galaxies can be removed from the selection through an inspection 
of the images. Some of the low-redshift galaxies are quite extended, 
and one could mistake a high-redshift merger for an extended low- 
redshift galaxy. Visual inspection reduces the low-redshift share 
from 9 percent to 4 percent. The compact and extended selection 
on the CFHT data is very efficient at identifying quasars. There is 
also room for improving the spectroscopic redshift determination 
and thus re-classify 'single emission line' galaxies : they represent 
a 12 percent share, among which 10 percent are at z > 0.6. It seems 
reasonable to assume an efficiency improvement from 46 percent 
ELG(z > 0.6) + 14 percent quasar to 61 percent ELG(z > 0.6) + 
14 percent quasar. Thus a total efficiency of ~ 75% 

For the gri selection : improving the spectroscopic redshift 
determination pipeline can gain up to 5 percent efficiency thus in- 
creasing from 73 to 78 percent of ELG(z > 0.6). 

We have also optimized target selections for BAO sampling 
density using the four bands ugri. We find that the optimum selec- 
tions have a redshift distribution close to the smooth combination 
of the gri and ugr selections discussed here; see Fig. [13] 



7 CONCLUSION 

We present an efficient emission-line galaxy selection that can pro- 
vide a sample from which one can measure the BAO feature in the 
2-point correlation function at z > 0.6. With the photometry avail- 
able today we can plan for a BAO measure to redshift 1 with the 
BOSS spectrograph. 

A representative set of photometric surveys that might be 
available for target selection in a near future on more than 2,000 
square degrees are : 



• The Kilo Desree Survey (KIDSj^ 

aims at observing 1500 
square degrees in the ugri bands with 3cr depth of 24.8, 25.4, 25.2, 
24.2 using the VST. 

• the South Galactic Cap U-band Sky Surve30(SCUSS) aims a 
5cr limiting magnitude of 23.0 

• the Dark Energy Survey (DES) aims at observing 5,000 square 
degrees in griz bands with 10 a depth of 24.6, 24.1, 24.3, 23.9. 
This survey does not include the u band ( [The Dark Energy Survey] 
|Collaboration|2005|[Baiierji et al.|2008| ). 

• the Large Synoptic Survey Telescope (LSST) pvezic et al.| 
|2008| ) plans to observe 20,000 square degrees in ugrizy bands with 
5 o- depth of 26.1, 27.4, 27.5, 26.8, 26.1, 24.9. 

Using such deeper photometric surveys and improved 
pipelines, it should be possible to probe BAO to redshift z = 1 .2 
in the next 6 years, e.g. by the eBOSS experiment, and to z = 1.7 in 
the next 10 years, e.g. by PFS-SuMIREor BigBOSS experiment. 



ACKNOWLEDGEMENTS 

Johan Comparat especially thanks Carlo Schimd and Olivier Ilbert 
for insightful discussions about this observational program and its 
interpretation. 

We thank the SDSS-III/BOSS collaboration for granting us 
this ancillary program. 

Funding for SDSS -III has been provided by the Alfred P. 
Sloan Foundation, the Participating Institutions, the National Sci- 
ence Foundation, and the U.S. Department of Energy Office of Sci- 
ence. The SDSS-III web site is http://www.sdss3.org/ 

SDSS-III is managed by the Astrophysical Research Con- 
sortium for the Participating Institutions of the SDSS-III Collab- 
oration including the University of Arizona, the Brazilian Par- 
ticipation Group, Brookhaven National Laboratory, University of 
Cambridge, Carnegie Mellon University, University of Florida, the 
French Participation Group, the German Participation Group, Har- 
vard University, the Instituto de Astrofisica de Canarias, the Michi- 
gan State/Notre Dame/JINA Participation Group, Johns Hopkins 
University, Lawrence Berkeley National Laboratory, Max Planck 
Institute for Astrophysics, Max Planck Institute for Extraterres- 
trial Physics, New Mexico State University, New York University, 
Ohio State University, Pennsylvania State University, University of 
Portsmouth, Princeton University, the Spanish Participation Group, 
University of Tokyo, University of Utah, Vanderbilt University, 
University of Virginia, University of Washington, and Yale Uni- 
versity. 

Based on observations obtained with MegaPrime/MegaCam, 
a joint project of CFHT and CEA/DAPNIA, at the Canada-France- 
Hawaii Telescope (CFHT) which is operated by the National Re- 
search Council (NRC) of Canada, the Institut National des Sci- 
ence de rUnivers of the Centre National de la Recherche Scien- 
tifique (CNRS) of France, and the University of Hawaii. This work 
is based in part on data products produced at TERAPIX and the 
Canadian Astronomy Data Centre as part of the Canada-France- 
Hawaii Telescope Legacy Survey, a collaborative project of NRC 
and CNRS. 

The BOSS French Participation Group is supported by 
Agence Nationale de la Recherche under grant ANR-08-BLAN- 
0222. 



^ ^ http://kids.strw.leidenuniv.nl/ 
http://batc.bao.ac.cn/Uband/ 



Investigating Emission Line Galaxy Surveys with the Sloan Digital Sky Survey Infrastructure 15 



140 




redshift 

Figure 13. Photometric redshift distributions obtained using the ugri bands. 
The dashed black lines are the low and high density goals mentioned in 
Section [2] h = 10'^ and 3 x lO'^ /z^Mpc"^. The dashed red line is the 
BOSS CM ASS sample. The solid blue line is the distribution enhanced by 
the ugri selection, it has a projected sky density of ~ 340 deg~^. The solid 
red line is the gri selection (projected sky density ~ 350 deg~^). The solid 
green is the ugr selection (projected sky density ~ 400 deg~^). It shows 
the possibility of making a selection able to sample [0.6, 1.2] for a BAO 
experiment. 
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APPENDIX A: TABLE OF A SUBSAMPLE OF OBSERVED 
GALAXIES AT Z > 0.6 
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Figure Al. wgr-selected ELGs at z < 0.5. The spectra are represented in fx convention. The coloured image on the left is courtesy of CFHT cutout service. 
The black and white image in the center is the ^-band. The observed object is the one in the centre of the images. 
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Figure A2. wgr-selected ELGs at 0.5 < z < 1.0. The spectra are represented in fx convention. The coloured image on the left is courtesy of CFHT cutout 
service. The black and white image in the center is the ^-band. The observed object is the one in the centre of the images. 
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Figure A3, wgr-selected ELGs at 1.0 < z < 1.6. The spectra are represented in fx convention. The coloured image on the left is courtesy of CFHT cutout 
service. The black and white image in the center is the ^-band. The observed object is the one in the centre of the images. 
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Figure A4. wgr-selected quasars at 1.0 < z < 2.3. The spectra are represented in convention. The coloured image on the left is courtesy of CFHT cutout 
service. The black and white image in the center is the ^-band. The observed object is the one in the centre of the images. 
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